How Realistic is Artificially Added Noise?
نویسنده
چکیده
Evaluations of algorithms for robust automatic speech recognition (ASR) are often based on artificial noisy speech instead of realistic noisy speech. In this paper we compare the ASR performance of speech with artificial additive noise to the performance of realistic noisy speech. All data was recorded during the same recording campaign and with nearly identical channel characteristics. The simulation process takes into account all major characteristics of the noisy reference data. Clean speech, noisy speech and simulated speech are compared for different aspects of robust ASR including noise reduction by Spectral Subtraction and the ETSI robust front end. The results show, that artificial noisy speech even in very controlled simulation environments is not very similar and not a full substitute for realistic noisy data. While the tendencies of the improvement for artificial and realistic data are similar for the evaluated approaches, the magnitude can be quite different.
منابع مشابه
A Comparative Study of Model Compensation Methods for Robust Speech Recognition in Noisy Conditions
Background noise can cause severe degradation of performance for speech recognition systems. Robustness towards background noise can be achieved by applying model-based compensation approaches. For systems that use MFCC features the relationship between noise, speech, and resulting noisecorrupted speech is non-linear, and an important aspect of model-based approaches is how to approximate this ...
متن کاملEffects of microarray noise on inference efficiency of a stochastic model of gene networks
The appearance of microarray technology led to the development of algorithms to infer the structure underlying the dynamics of gene regulatory networks (GRNs) from gene expression data. Yet, this technique is currently highly noisy, leading to the question of how inferable are GRNs from this data. To answer this question, we use realistic models of GRNs [1, 2, 3] and artificially introduce real...
متن کاملEntropies of the Eeg: the Effects of General Anaesthesia
The aim of this paper was to compare the performance of different entropy estimators when applied to EEG data taken from patients during routine induction of general anesthesia. The question then arose as to how and why different EEG patterns could affect the different estimators. Therefore we also compared how the different entropy estimators responded to artificially generated signals with pr...
متن کاملMinimax classification with parametric neighborhoods for noisy speech recognition
In this paper we derive upper and lower bounds on the mean of speech signals corrupted by additive noise. The bounds are derived in the log spectral domain. Approximate bounds on the first and second order time derivatives are also developed. It is then shown how to transform these bounds to the MFCC domain to be used by conventional cepstrum-based speech recognizers. The proposed bounds define...
متن کاملEditorial: Using Noise to Characterize Vision
Auditory noise is a sound, a random variation in air pressure. More generally, random " noise " can be introduced into any stimulus, including a visual display. Noise added to the stimulus can probe the computations underlying perception of the stimulus. With power and precision, the noise, by restricting the information available, places fundamental constraints on attainable performance and pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011